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Abstract 

In this two-part paper, we consider the multiantenna multihop relay channels in which the source 
signal arrives at the destination through N independent relaying hops in series. The main concern of this 
work is to design relaying strategies that utilize efficiently the relays in such a way that the diversity is 
maximized. In part I, we focus on the amplify-and-forward (AF) strategy with which the relays simply 
scale the received signal and retransmit it. More specifically, we characterize the diversity-multiplexing 
tradeoff (DMT) of the AF scheme in a general multihop channel with arbitrary number of antennas 
and arbitrary number of hops. The DMT is in closed-form expression as a function of the number 
of antennas at each node. First, we provide some basic results on the DMT of the general Rayleigh 
product channels. It turns out that these results have very simple and intuitive interpretation. Then, the 
results are applied to the AF multihop channels which is shown to be equivalent to the Rayleigh product 
channel, in the DMT sense. Finally, the project-and-forward (PF) scheme, a variant of the AF scheme, 
is proposed. We show that the PF scheme has the same DMT as the AF scheme, while the PF can have 
significant power gain over the AF scheme in some cases. In part II, we will derive the upper bound 
on the diversity of the multihop channels and show that it can be achieved by partitioning the multihop 
channel into AF subchannels. 

Index Terms 

Multihop, multiple-input multiple output (MIMO), relay channel, amplify-and-forward (AF), diversity- 
multiplexing tradeoff (DMT). 
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Diversity of MIMO Multihop Relay 
Channels — Part I: Amplify-and-Forward 



I. Introduction and Problem Description 

Wireless relaying systems have lots of advantages over traditional direct transmission systems. 
For example, the periphery can be extended by the relays and the coverage of the existing network 
can be improved. Using relays can also shorten the point to point transmission distance, which 
results in lower power (interference) level or in higher throughput. Furthermore, all these benefits 
can be realized in a more flexible, easier and cheaper to deploy network. 

Recently, there has been a boosting interest in the cooperative diversity with which the spatial 
diversity is exploited through distributed relays. Since the work of Sendonaris et al. [1], [2] 
that introduced the notion of cooperative diversity, a number of relaying protocols have been 
proposed (see, e.g., [3]— [10]). Most of the previous works consider the single- antenna two-hop 
relay channel where the source signal is able to arrive at the destination through at most two 
hops, i.e., the source-relay hop and relay-destination hop. In an iV-relay channel, it is shown 
that a diversity order of N + 1 (respectively, N) can be achieved with (respectively, without) the 
direct source-destination link. 

In this work, we consider the MIMO multihop channel model without direct source-destination 
link. That is, the source signal arrives at the destination through N independent relaying hops in 
series. In the two-hop case, our model is reduced to the model studied by Jing and Hassibi [6]. 
The central concern of our work is to design relaying strategies that utilize efficiently the relays in 
such a way that the diversity is maximized. In part I, we focus on the amplify-and-forward (AF) 
strategy with which the relays simply scale the received signal and retransmit it. The main 
contributions of this paper are as follows. 
1) First, we obtain the diversity-multiplexing tradeoff (DMT) of the Rayleigh product channel, 
whose channel matrix is a product of independent Gaussian matrices. It turns out that each 
Rayleigh product channel belongs to an equivalent class that is uniquely represented by the 
so-called minimal form. Furthermore, based on the closed-form expression of the DMT, we 
derive a recursive DMT characterization that have very simple and intuitive interpretation. 
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2) Then, it is shown that the AF multihop channel is actually equivalent to the Rayleigh product 
channel. We can thus identify the two channels and all previously established results apply 
to the multihop channel. Therefore, the diversity properties of the AF multihop channel 
in terms of the number of hops and the number of antennas in each node are completely 
characterized. We also propose the project-and-forward (PF) scheme, a variant of the AF 
scheme, in the case where full antenna cooperation is possible. It is shown that, although 
the PF scheme has the same DMT as the AF scheme, the PF can have significant power 
gain over the AF scheme in some cases. 

3) Finally, it is pointed out that using less relaying antennas improve the power gain by 
avoiding the hardening of relayed noise, a particular phenomenon in the AF multihop 
channel. And reducing the number of transmit antennas can lower significantly the coding 
delay and decoding complexity. The vertical channel reduction result gives exactly the 
minimum number of antennas we need at each node to keep the same DMT. 

In part II of this paper, we will derive an upper bound on the diversity of the multihop channels 
and show that the AF scheme is not optimal in general. Then, we will proposed both distributed 
and non-distributed schemes that achieve the upper bound. The main idea is to partition the 
multihop channel into AF subchannels. 

The rest of part I is organized as follows. Section [TT] presents the channel model and the AF 
scheme with some basic assumptions. The Rayleigh product channel is introduced and studied in 
section Hill Results concerning the AF and PF schemes are collected in section [IV] In section [VT, 
numerical results on some typical scenarios are shown. Finally, we draw a brief conclusion 
in section |VIJ For fluidity of the presentation, all demonstrations of proofs are delayed to the 
appendices. 

In this paper, we use boldface lower case letters v to denote vectors, boldface capital letters M 
to denote matrices. CM represents the complex Gaussian random variable. [•] T , [•]* respectively 
denote the matrix transposition and conjugated transposition operations. ||-|| is the vector norm. 
(x) + means max(0,x). Det(M) is the absolute value of the determinant det(M). The square 
root y/P of a positive semi-definite matrix P is defined as a positive semi-definite matrix such 
that P = \/P~{\fP) . The ordered eigenvalues of a positive semi-definite matrix P are denoted 
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Fig. 1. A MIMO multihop relay channel. 



by A(P) or /x(P). We define a{P) and /3(P) by 

ai(P)^-logA i (P)/logSNR and A(P) 4 -log /i,(P)/ log SNR. 

And we call them the eigen-exponents of P, with a slight abuse of terminology. We drop the 
arguments of \, fi,a., (3 when confusion is not likely. For any quantity q, 

q = SNR a means lim - ^f'L = a 
SNR->oo log SNR 

and similarly for < and > . The tilde notation n is used to denote the (increasing) ordered 
version of n. Let m and n be two vectors of same length L, then m <n means fhi < hi, Vz. 

II. System Model 

A. Channel Model 

The considered iV-hop relay channel model is illustrated in Fig. [H where there are one 
source (node #0), one destination (node #N), and N — 1 clusters of intermediate relays. Each 
cluster is logically seen as a node (node #1 to node #iV — 1) that is equipped with multiple 
antennas (rii antennas for node We assume that node #i can only hear node #i — 1. 
Mathematically, we have 

V, = HiXi-i + Zi 

where Hi G C niXni_1 is the channel between node #i — 1 and node #i; Xi,y { G C" iXl is the 
transmitted and received signal at node #i; z G C niXl G CA/"(0, 1) is the additive white Gaussian 
noise at node #i. The channels H^s are independent and modeled as Rayleigh quasi-static 
channels, i.e., the entries of Hi are i.i.d. CjV(0, 1) distributed and do not change during the 
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Fig. 2. Amplify-and-forward strategy for multihop channels. 



transmission of a data frame. For simplicity, it is assumed that the intermediate nodes work in 
full-duplej] mode and all transmitting nodes are subject to the same short-term power constraint 

E{||a;,;|| 2 } < SNR, Vz (1) 

where the expectation is taken on the noises. All terminals are supposed to have full channel 
state information (CSI) at the receiver and no CSI at the transmitter. From now on, we denote 
the channel as a (n , ni, . . . , tin) multihop channel. 

B. Amplify -and- Forward Protocol 

The AF strategy is described as follows. At each node, the received signal of each antenna 
is normalized to the same power level and then retransmitted. As shown in Fig. [21 the signal 
model is 

Vi =HiXi + Zi, 
Xi+i = Dy, 

where the transmitted signal rCj has the short-term power constraint 



E(Ni]| 2 )< SNR 



2 \ < . 



n 

the scaling matrix Di G C niXn ' is diagonal with the normalization factors^ 



1 /SNR 

rii 



'The assumption is merely for simplicity of notation. As one can easily verify, since no cross-talk between different channels, 
the half-duplex constraint is directly translated to a reduction of degrees of freedom by a factor of two and does not impact the 
relaying strategy. This is achieved by letting all even-numbered (respectively, odd-numbered) nodes transmit (respective, receive) 
in even-numbered time slot and received (respective, transmit) in odd-numbered time slots. 

2 In the case where long-term power constraint is imposed, we simply replace the channel coefficients \Hi[j, k] \ in lO by l's. 
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C. Diversity-Multiplexing Tradeoff 

In this paper, we use the diversity-multiplexing tradeoff (DMT) as the performance measure. 
Definition 1 (Multiplexing and diversity gains [11]): The multiplexing gain r and diversity 
gain d of a fading channel are defined by 

r 4 , im « and <iA_ lim ^PUSMKR) 
sNR^oologSNR snr-^oo logSNR 

where i?(SNR) is the target data rate and P out (SNR, R) is the outage probability for a target rate 

R. A more compact form is 

P out (SNR,rlogSNR) = SNR~ d . (3) 
Note that in the definition we use the outage probability instead of the error probability, since 
it is shown in [11] that the error probability is dominated by the outage probability in the high 
SNR regime and that the thus defined DMT is the best that we can achieve with any coding 
scheme. 

Lemma 1: The DMT of a n t x n r Rayleigh channel is a piecewise-linear function connecting 
the points (k, d(k)), k — 0, 1, . . . , min (n t , n r ), where 

d{k) = (n t — k)(n r — k). 

III. The Rayleigh Product Channel 

As it is shown in the next section, the AF multihop channels are intimately related to a more 
general Rayleigh product channel defined below. In this section, we investigate the Rayleigh 
product channel and provides some basic results on the diversity. Let us begin by the following 
definitions. 

Definition 2 (Rayleigh product channel): Let H { e c n< - lX "% % — 1, 2, . . . , N, be N indepen- 
dent complex Gaussian matrices with i.i.d. zero mean unit variance entries. A (n ,ni, . . . ,njv) 
Rayleigh product channel is a n N x n MIMO channel defined by 



y = J SNR Ux + z (4) 

V n 1 ---n N 

where II = H\H.i ■ ■ H N ; x is the transmitted signal with power constraint E(||x|| 2 ) < un\ 
z G C n ° xl ~ CA/"(0, 1) is the additive white Gaussian noise; SNR is the receive signal-to-noise 
ratio (SNR) per receive antenna. 



February 1, 2008 



DRAFT 



6 



Definition 3 (Exponential equivalence): Two channels are said to be exponentially equivalent 
or equivalent if their eigen-exponents have the same asymptotical joint pdf. 
Let n be the ordered version of n with h N > un-i > • • • > h . 

Definition 4 (Reduction of Rayleigh product channel): A (m , mi, ... , m^) Rayleigh product 
channel is said to be a reduction of a (n , rii, . . . , n N ) Rayleigh product channel if 1) they are 
equivalent, 2) k < N, and 3) (m , m±, . . . , rrik) ^ (n , ni, . . . , n^). In particular, if k — N, then 
it is called a vertical reduction. Similarly, if to, = fij, Vi G [0, k\, it is a horizontal reduction. 

Definition 5 (Minimal form): (n ,ni, . . . ,n^*) is said to be a minimal form if no reduction 
other than itself exists. Similarly, it is called a minimal vertical form (respectively, minimal 
horizontal form) if no vertical (respectively, horizontal) reduction other than itself exists. A 
channel is said to have order N* if its minimal form is of length N* + 1. 

A. Joint PDF of the Eigen-exponents of IHT 

Theorem 1: Let us denote the non-zero ordered eigenvalues of IIII t by Ai > • • • > A„ min > 
with n min = min rij. Then, the joint pdf of the eigen-exponents a satisfies 

i=0,...,N 

fSNR-^, for <«!<...<«„ 
p(a) = \ (5) 
lSNR"°°, otherwise 



where 



with 



Cj = 1 — i + min 



= (6) 

i=i 

i =0 ni-i 



fc=l,...,JV 



k 



1, . . . , ^miri' (7) 



By definition, n min = n and we interchange the notations depending on the context. From 
the theorem, we can see that the asymptotical eigen-exponents distribution depends only on 
(h , hi, ... , h N ), the ordered version of (n , rii, . . . , n N ). For example, a (3, 1, 4, 2) channel is 
equivalent to a (1,2,3,4) channel, in the eigen-exponent sense. 

Theorem 2: A (n , ni, . . . , tin) Rayleigh product channel can be reduced to a (h , hi, ... , n fc ) 
channel if and only if 

k 

k(h k+ i + l) > (8) 



1=0 
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In particular, it can be reduced to a Rayleigh channel if and only if 



n 2 + 1 > n + h\. 



(9) 



This theorem implies that (ho, hi, . . . , h^*) is a minimal form if there exists no k < N* such 
that © is satisfied. One can also verify that if (h ,hi, . . . , njv*) is a minimal horizontal form 
of (no, ni, . . . , njsf), then 1) it is also a minimal form; and 2) the minimal vertical form is 

(no, hi, ... , h^*,h, . . . , h) where 



Furthermore, note that the order N* is upper-bounded by h because ([8]) is always satisfied with 
k — no- In other words, the length of the minimal form is bounded by h + 1. In particular, the 
minimal form of a (1,%, . . . ,n N ) Rayleigh product channel is always (l,ni), i.e., a 1 x hi or 
rii x 1 Rayleigh channel. 

Theorem 3: Two Rayleigh product channels are equivalent if and only if they have the same 
minimal form. 

From this theorem, we deduce that the class of exponential equivalence is uniquely identified 
by the minimal form. Therefore, N* can also be defined as the order of the class. 

B. Characterization of the Diversity-Multiplexing Tradeoff 

From theorem Q3 we can derive the DMT of a Rayleigh product channel. 
Theorem 4 (Direct characterization): The DMT of a Rayleigh product channel (n , ni, . . . , n N ) 
is a piecewise-linear function connecting the points [k, d[k)), k = 0, 1, ... , n m i n , where 



with Q defined by ©. 

Since the DMT is a bijection of the coefficients q's, all results obtained previously apply to 
the DMT and two Rayleigh product channels are equivalent if and only if they have the same 
DMT. Hence, the exponential equivalence class is also the DMT-equivalence class. However, 
unlike the eigen-exponent, the DMT provides an insight on the diversity performance of a 
channel (or a scheme) for different multiplexing gain. Note that, despite the closed-form nature 
of the characterization (fTTT) . it is lack of intuition. That is why we search for an alternative 
characterization. 



n. 




(ID 



i=k+l 
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Theorem 5 (Recursive characterization): The DMT d(k) defined in (fTTT) can be alternatively 
characterized by 

Rl\k) : d(no,...,t» w )(fc) = d(n o -k,...,n N -k)(0), Vfc; (12) 
R^\i) : rf(n o ,-.,n w )(0) =rnind (nOr .. ) „ i) (j)+% ni+1) ... injv) (0), Vz; (13) 

R ( 3 N \i,k) : d {m ,...,n N )(k) = mmd {no ^ ni) (j) + c%,„ i+1 ,..., njv )(fc), Vz, A;. (14) 
The recursive characterization has an intuitive interpretation as follows. Let us consider A; as a 
"network flow" between the source and the destination and d(k) as the minimum "cost" to limit 
the flow to k (the flow-fc event). In particular, the maximum diversity d(0) can be seen as the 
"disconnection cost". First, R\(k) says that the most efficient way to limit the flow to k is to 
keep a (k, k, . . . , k) channel fully connected and to disconnect the (n — k, n\ — k, . . . , n N — k) 



residual channel, as shown in Fig. |3(a)| Then, /^(O suggests that in order to disconnect a 
(n ,ni, . . . ,n N ) channel, if we allow for j flows from the source to some node i, then the 
(j,n i+1 , . . . ,njv) channel from the j "ends" of the flows at node i to the destination must be 
disconnected. The idea is shown in Fig. |3(b)| Obviously, the most efficient way is such that the 
total cost is minimized with respect to j. This interpretation sheds lights on the typical outage 
event of the Rayleigh product channel. In the trivial case of N = 1 (the Rayleigh channel), there 
is only one subchannel. The typical and only way for the channel to be in outage is that all the 
paths are bad, i.e., the disconnection cost is n x n±. In the non-trivial cases, there are more than 
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one subchannels and thus the typical outage event is not necessarily for one of the subchannels 
being totally bad. The mismatch of two partially bad subchannels can also cause outage. In a 
more general way, the flow-A; event takes place when both the flow-j event in the (no, • • • , n*) 
channel and the flow-A; event in the (j, n i+ i, . . . , n^) channel happen at the same time. We can 
verify that (Ri(k), R 3 (i,k)) is equivalent to (Ri(k) , R 2 (i)) . Note that the DMT is completely 
characterized by these relations in a recursive manner. 

The following corollaries conclude some properties of the DMT of the Rayleigh product 
channel. 

Corollary 1 (Monotonicity): The DMT is monotonic in the following senses : 

1) if (ni,o, ni,i, . . . , Ux,n) h ( n 2,o, n 2 ,i, n 2 ,jv), then 

d(ni i0 ,...,n ltN )( r ) — ^(na,o,— ,n2,Jv) ( r )> 

2) if {ni, ,ni,i, . . -^i,^} 2 {n 2) o,n 2) i, ■ . • ,n 2 jv 2 }, then 

d (ni,o,...,ni,Ar 1 )( r ) < 4^0,...,^ ^ (r) , Vr - 

Corollary 2: Let us define 

ho k = 0, 



Pfe-<Eto^-^fc+i fc = l,...,iV-l, (15) 
-oo k — N. 

Then, 

d(n ,...,niv)( r ) = d(fi ,...,fi fc )(r), for r > p A . 
While corollary Q] implies that d(r) < d(n ,...,n k )(r) in a general way, corollary |2| states precisely 
that d(r) coincides with <2(no,...,n*)( r ) f° r r > Pfc- 
Corollary 3 (Upper bound and lower bound): 

nphi _ _ 

— — < a(0) < n ni 

where d(0) is known as the maximum diversity gain. 

From © and (fTT|) . the upper bound is obtained by setting h 2 large enough and the lower bound 
is obtain by setting h 2 = ... = h N . This corollary implies that the diversity of a Rayleigh 
product channel can always be written as d(0) = ah hi with a 6 (0.5, 1]. Hence, the diversity 
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"bottleneck" of the Rayleigh product channel II is not necessarily one of the subchannels Hi, 
but rather the virtual h x h\ Rayleigh channel. On the other hand, the maximum diversity gain 
is always strictly larger than !H j i , independent of the value N. In order to illuminate the impact 
of N on the DMT, let us consider the symmetric case. 

Corollary 4 (Symmetric Rayleigh product channels): When no = n\ = . . . = tin = n, we 
have 

d(k) = (n-k)(n+l-k) + o|) ((aW _ 1)N + 2m (16) 
where a(k) = [^J and b(k) = (n - k) mod N. 

In the symmetric case, on one hand, we observe that the DMT degrades with N. On the other 
hand, from (fl6l) . the degradation stops at N = n and we have 

d(k) = ( "- fc)( " 2 +1 ' fc) 

for N > n. This can also be deduced from theorem [2] applying which we get that the order of all 
symmetric Rayleigh product channel with N > n is N* = n. Therefore, we lose less than half 
of the diversity gain due to the product of Rayleigh MEMO channels, in contrast to the intuition 
that the maximum diversity gain could degrade to 1 with N — >• oo. As an example, in Fig.|H we 
show the DMT of the 2x2 and 5x5 Rayleigh product channels with different values of N. 

C. General Rayleigh Product Channel 

In fact, we can define a more general Rayleigh product channel as 

U g = H 1 T 12 H 2 ■ ■ H n-\T n-i,nH at. (17) 
Theorem 6: The general Rayleigh product channel is equivalent to 

1) a (no, ni, . . . , njv) Rayleigh product channel, if all the matrices T i i+1 's are square and their 
singular values satisfy aj(T iji+1 ) = SNR°, Vz, j; 

2) a (n ,n[, . . . ,n^ v _ 1 ,n7v) Rayleigh product channel, with n\ being the rank of the matrix 
T iji+ i, if the matrices T^+i's are constant. 

Therefore, the results obtained previously for the Rayleigh product channel can be applied to 
the general one. 
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Fig. 4. Diversity-multiplexing tradeoff of 2 x 2 and 5x5 symmetric Rayleigh product channels. 

IV. Amplify- and-Forward Multihop Channels 

Using the results from the previous section, we are going to analyze the performance of the 
AF scheme presented in section HH in terms of the DMT. 

A. Equivalence to the Rayleigh Product Channel 

With the AF scheme, the end-to-end equivalent MIMO channel is 



where for the sake of simplicity, we define Y[f=i ^% — Aw • -A for any matrices A' s ! H N+1 = I 
and D N = I. The standard whitened form of this channel is 



where z ~ CAf (0,1) is the whitened version of the noise and y/R is the whitening matrix with 
R the co variance matrix of the noise in (fl~8l . Since it can be shown that A max (i2) = A m i n (i2) = 
SNR°, the AF multihop channel is DMT-equivalent to the channel defined by 




(18) 
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Fig. 5. The project-and-forward scheme. 



which is a general Rayleigh product channel defined in (fTTT ) if we have (Tj(-Dj) = SNR°, Vi, j. 
To this end, we slightly modify the matrices D/s and get the new matrices Di with 

Di[j,j] = mm {Di\j,j],K} 

where < k < oo is a constant independent of SNR. Furthermore, it is obvious that the power 
constraint is still satisfied by replacing Di with Therefore, the multihop channel with the 
thus defined AF strategy is DMT-equivalent to a (n , m, . . . , njy) Rayleigh product channel, i.e., 

^min 

d AF (k) = 

i=k+l 

In the rest of the paper, we identify the Rayleigh product channel, the AF multihop channel and 
the vector (n , n 1; . . . , n N ) when confusion is not likely. 

B. A Variant : Project-and-Forward 

We propose a new scheme called project-and-forward (PF), as shown in Fig.|51 This scheme 
can be used only when full antenna cooperation within cluster is possible, that is, all antennas 
in the same cluster are controlled by a central unit. At the node #i, the received signal is first 
projected to the signal subspace Si, spanned by the columns of the channel matrix H_ { . The 
dimension of Si is r i5 the rank of i/j. After the component-wise normalization, the projected 
signal is transmitted using Ti (out of rii) antennas. It is now clear that H^ i+ i £ C ni+lXri is actually 
composed of the r,; columns of the previously defined H i+1 , with r = n . 

More precisely, the Q E C HiXri is an orthogonal basis of Si with Q\Q i = I- We can rewrite 

H { = QG, 

3 The k is only for theoretical proof and is not used in practice, since we can always set k a very large constant but independent 
of SNR. In this case, Di — Di with probability close to 1 for practical SNR. 



February 1, 2008 



DRAFT 



13 



with Gj G C riXri_1 . For simplicity, we let Q be obtained by the QR decomposition of H_ { if 
n-i > fi-i and be identity matrix if n,i < r\. The main idea of the PF scheme is not to use 
more antennas than necessary to forward the signal. Since the useful signal lies only in the 
?vdiniensional signal subspace, the projection of the received signal provides sufficient statistics 
and reduces the noise power by a factor j L . In this case, only antennas are needed to forward 
the projected signal. Let us define = TX$.. Then, as in the AF case, the PF multihop channel 
is DMT-equivalent to the channel defined by 

n PF = H_ N P N ^i ■ ■ H_ 2 PiH_ v 

The following theorem states that using only out of antennas to forward the projected 
signal does not incur any loss of diversity, as compared to the AF scheme. 

Theorem 7: The PF multihop channel is DMT-equivalent to a (n , ni, . . . , un) Rayleigh prod- 
uct channel. 

While the PF and AF have the same diversity gain, the PF outperforms the AF in power gain 
for two reasons. One reason is, as stated before, that the projection reduces the average noise 
power. The other reason is that the accumulated noise in the AF case is more substantial than 
that in the PF case. This is because in the PF case, less relay antennas are used than in the 
AF case. Since the power of independent noises from different transmit antennas add up at the 
receiver side, the accumulated noise in the AF case "enjoys" a larger "transmit diversity order" 
than in the PF case. We call it the noise hardening effect. Some examples will be given in the 
section of numerical results. 

C. Practical Issues 

1 ) Space-Time Coding: From the input-output point of view, the multihop channel with AF/PF 
protocol is merely a linear MIMO fading channel, for which the DMT- achieving space-time codes 
exist. For example, in [11], a Gaussian code is shown to achieve the DMT of a n x n\ Rayleigh 
channel if the code length / > n + n\ — 1. This result can easily be extended to a general 
linear fading channel and one can show that Gaussian coding is DMT-achieving for any fading 
statistics if I is large enough. 

Another family of code construction is based on cyclic division algebra (CDA). These codes 
have minimum length n and are commonly known as the Perfect codes [12], [13]. They are 
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DMT- achieving thanks to the so-called non-vanishing determinant (NVD) property. It has been 
shown that they are approximately universal [13], [14] since they are DMT-achieving for all 
fading statistics. Therefore, we propose to use the rate-n n Q x n Perfect codes. In this case, 
the only information that the source need to know is n . 

2) Antenna Reduction: In the AF case, provided the number of total available antennas 
(n ,ni, . . . , njv), the vertical reduction result gives an exact number of necessary antennas at 
each node in the DMT sense. This result can be used to reduce the number of transmit and relay 

n 

antennas^. If Perfect space-time codes are used, reducing the number of transmit antennas n 
means reducing the coding length, i.e., coding delay and decoding complexity, since the code 
length is equal to the number of transmit antennas. For instance, only two transmit antennas are 
needed in a (4, 2, 2, 2) channel. Therefore, instead of using a 4 x 4 Perfect code the code length 
of which is 4, one can use the Golden code [15] of length 2 and still achieve the DMT. 

In fact, less relay antennas also means less relay signaling (relay probing, synchronization, 
etc.) overhead especially when different antennas are from different relaying terminals (single- 
antenna relays). Furthermore, using more relay antennas hardens the relayed noise. This is the 
same phenomenon as we stated in the PF case. Therefore, the number of relay antennas at each 
node should be restricted to n (defined in (flOl)). the number given by the vertical reduction. 



V. Examples and Numerical Results 

In this section, we provide some examples of multihop channels and show the performance of 
AF scheme with simulation results. In all cases, we make the same assumptions as in section [III 

A. Horizontal and Vertical Reduction 

Outage performances versus the received SNR per node of different multihop channels are 
shown in Fig. [7] Note that both the (2, 2) and (2, 2, 2) channels are minimal and have diversity 
order 4 and 3, respectively. The (3, 2, 2) channel can be horizontally reduced to (2, 2) and thus 
has diversity 4. Similarly, the (2, 2, 2, 2), (4, 2, 2, 2) and (8, 2, 2, 2) channels can be reduced 
to (2, 2, 2) and have diversity 3. As compared to the (2, 2, 2, 2) channel, the larger number of 

4 Reducing the number of receive antennas does not do any good, since more receive antennas always provide larger power 
gain without increasing the complexity. 
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transmit antennas in the (8, 2, 2, 2) weakens the fading of the first hop and the performance is 
close to the (2, 2, 2) channel. 

Another example is to illustrate the vertical reduction of multihop channels, as shown in Fig.[8] 
We first consider the case of a (1,4, 1) channel. The necessary antenna number n is 1 and the 
minimal vertical form is thus (1, 1, 1). We observe that, although both the (1,4, 1) and (1, 1, 1) 
channels have diversity 1, a power gain of 7 dB is obtained at P out = 1CT 4 by using only one 
relay antennas out of four, if the AF scheme is used. As stated in section HV-C.21 the gain is 
due to avoiding the hardening of relayed noise. Then, we consider the (3, 1,4,2) channel. The 
necessary number of antennas n is 2 in this case. As shown in Fig.[8l by restricting the number of 
relay antennas to 2, we have a (3, 1, 2, 2) channel and a gain of 2 dB is observered at P out = 10~ 4 . 
We can further reduce the number of transmit antennas to 2 to get a (2, 1, 2, 2) channel. Unlike 
the reduction of relay antennas, the reduction of transmit antennas does not provide any gain 
because it does not affect the relayed noise. In contrast, it degrades the performance since the first 
hop (2, 1) is faded more seriously than the original first hop (3, 1). Nevertheless, the (2, 1, 2, 2) 
channel is still better than the (3, 1, 4, 2) channel and is only 0.7 dB from the (3, 1, 2, 2) channel. 

B. Project-and-Forward 

In Fig.[9l we compare the PF scheme with the AF scheme for the (1,2,1) and (1,3,2), 
respectively. First of all, note that the AF and the PF have the same diversity order, as predicted. 
Then, a power gain of 8.5 dB (respectively, 6.5 dB) over the AF scheme is obtained by the 
PF scheme in the (1,2,1) (respective, (1,3,2) channel). This is due to the maximum ratio 
combining (MRC) gain in the first hop and to avoiding the relayed noise hardening. 

C. Coded Performance 

We now study the coded performance of the AF multihop channel. The performance measure 
is the symbol error rate (SER) versus the received SNR under the maximum likelihood (ML) 
decoding. We still take the (3, 1, 4, 2) channel as an example. Since n = 1, the diagonal algebraic 
space-time (DAST) codej [16] can be used. As shown in FiglTOl with the DAST code, the symbol 
error rate performances of in the (3, 1, 4, 2), (3, 1, 2, 2) and (2, 1, 2, 2) channels have exactly the 

5 Note that the DAST code is the diagonal version of the rate-one Perfect code proposed in [12]. 
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same behavior as the outage performances of the channels do Fig.[8l Moreover, we can use the 
Alamouti code [17] for the (2, 1, 2, 2) channel. As we can see in the figure, the Alamouti code 
outperforms all the DAST codes with minimum delay and minimum decoding complexity. The 
potential benefits from the vertical reduction are thus highlighted. 



D. Multihop vs. Direct Transmission 

Finally, we introduce the path loss model [18] 

SNR reC eived oc distance~ Q SNR transmitted 

where a is the path loss factor. We fix the distance from the source to the destination and 
dispose the relay nodes on the source-destination line with equal distance. Each node contains 
two antennas. We compare the 2-, 3- and 4-hop channel with the direct transmission (single- 
hop) channel, the performance measure is the transmitted power gain of the multihop channel 
over the single-hop channel at certain target outage probability (1CT 3 and 1CT 4 ). The path loss 
factor a takes the typical values [18] 3, 3.5, and 4 for wireless channels. In Fig. |l 1(a)] the total 



transmission power in the multihop channel is considered. Power gain is obtained for a = 3.5 



and 4. Then, the transmission power per node is considered in Fig . 1 1 1(b)} In this case, power 
gain is obtained for all a and is as high as 11 dB. In practice, the transmission power per node 
also represents the interference level for other terminals which has a significant impact on the 
network capacity. In both figures, the power gain is lower at 10~ 4 than at 1CT 3 . This is due to 
the fact that the direct transmission channel is a 2 x 2 Rayleigh channel and has diversity 4, 
while the multihop channel is (2, 2, ... , 2) and has diversity 3. And low diversity gain means 
decreasing power gain with increasing SNR or equivalently, with decreasing outage probability. 

VI. Conclusion 

Perhaps the simplest relaying scheme in the MIMO multihop channel is the Amplify-and- 
Forward scheme. In part I of this paper, by identifying the AF multihop channel with the so- 
called Rayleigh product channel, we have obtained the complete characterization of the diversity- 
multiplexing tradeoff of the AF scheme in a multihop channel with arbitrary number of antennas 
and hops. The characterization is provided both in direct closed-form and recursive form. Based 
on the DMT, a number of properties of the AF multihop channel have been derived. 
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In the second part, we will show that the AF scheme is suboptimal in general, by establishing 
the diversity upper bound of the multihop channel with any relaying scheme. By partitioning 
the multihop channel into AF subchannels, we achieve the upper bound with both distributed 
and non-distributed schemes. 



The followings are some preliminary results that are essential to the proofs. 

Definition 6 (Wishart Matrix): The m x m random matrix W = HW is a (central) complex 
Wishart matrix with n degrees of freedom and covariance matrix R (denoted as W ~ W m (n, R)), 
if the columns of the m x n matrix H are zero-mean independent complex Gaussian vectors 
with covariance matrix R. 

Lemma 2: The joint pdf of the eigenvalues of W = HW ~ W m (n,i2 mxm ) is identical to 
that of any W' ~ W m '(n, diag(/i 1? . . . , /i m /)) if n x > . . . > jJ, m ' > fi m >+i = . . . = /i m = are 
the eigenvalues of R m xm- 

Proof: Let R = Q f diag(/ii, . . . , /z m /, 0, . . . , 0)Q be the eigenvalue decomposition of R. 
Then, define y/R = Q t diag( v //ii, . . . , ^// m /, 0, . . . , 0)Q and H can be rewritten as H = VRH 
with H having i.i.d. CAf(0, 1) entries. We know that the eigenvalues of HW are identical to 
those of 



where H = QH e C mxn has i.i.d. entries as H does; H e C m xn is composed of the first 
m! rows of H and its entries is thus i.i.d. as well. Finally, we prove the lemma using the fact 
that the eigenvalues of H diag(/ii, . . . , /ji m >)H are identical to those of 



Appendix I 



Preliminaries 



HH = HnRH { 



o 



(Q^o) t diag(/i 1 , . . . , /w, 0, . . . , 0)(Q# ) 
# diag(//i, . . . , /i m /, 0, . . . , 0)#o 
/f diag(/ii, . . .,/jL m/ )H 
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Lemma 3 ([19]-[22]): Let W be a central complex Wishart matrix W ~ W m (n,R), where 
the eigenvalues of R are distinct and their ordered values are [i\ > ... > /i m > 0. Let 
Ai > . . . > \ q > be the ordered positive eigenvalues of W with q = min{m,n}. The joint 
pdf of A conditionned on /i. is 



P (A|m) 



ir mjn Det(H 1 )n^r n_1 Ar m Il 



Ai — Ai 



i<j 



Hi - fJLj 



m n 

G m , n Det(H 2 ) J] _ HiXi - A 



if n > m, 



if n < m, 



(19a) 
(19b) 



with 



S A f e -V.r. and 

L J«j=l 



1 /i r . 



. . ..m-n-l ..m-n-l p — ^ 
rm rm ° 



m— n— 1 , 



(20) 



i^ m , n and G m ,n are normalization factors. In particular, for R = I, the joint pdf is 



(21) 



Now, let us define the eigen-exponents a 
— log/ij/logSNR, i — l,...,m. 
Lemma 4: 



i=l i<j 
A 



logAi/logSNR, i = l,...,q, and & 



Det(Si) 



SNR -^ 1 (a !> 0) j for ( a> ^ G 7^(i) 



.SNR 



otherwise, 



where 



j=l i<j 



3) ' 



(22) 



(23) 



and 



7eW ={«!<...< a m , /3i < . . . < /3 m , and A < ^, for i = 1, . . . , m} . (24) 
Proof: 

Please refer to [8] for details. ■ 



6 In the particular case where some eigenvalues of R are identical, we apply the l'Hospital rule to the pdf obtained, as shown 
in [21]. 
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Lemma 5: 



Det (H 2 ) = 



• SNR -£fe 2 (^) ) for ( Q!) ^) G ^(2) 



(25) 



.SNR 



otherwise, 



where 



£ S2 (a,/3)^(m-n-l)A+ £ ( m -^ l + ]T £J ( tti - ^) + + £ («, - &) + (26) 

i=l i=n+l j'=l i<j j'=n+l i=l 

and 

^ (2) ={«!<...< a n , (3x< ...<(3 m , and A < o<, for % = 1, . . . , n} . (27) 
Proof: First, we have 



Det(S 2 ) = JJ/^-^Det 



i=l 



-(in— n— 1) 



X g-<WyU 



(m— n— 1) 



-A n //i. 



(28) 



Then, let us denote the determinant in the right hand side (RHS) of (1281) as D and we rewrite 
it as 



D = Det 



4 



(m— n— 1) 
m 



,(m-n-l) 
m-l,m 



Det 



4 



(m— n— 1) 



g^^l/^l _ g-^l/Mn 



g^^l/Cm-l g— Al/Mn 



-Ai//i„ 



-Xn/fll p-An//.t 



7) 



(29) 



(m— n— 1) 

TO 



(1) 



Jm-n-1) 
u m— l,m 



"m-lra ° 



-Ai//ji 



-Ai//i„ 



-Xn/fJ.1 



II 



H(l" e- Al/Mm ) (30) 



«=i 



where = /^~ fe — /ij fc and the product term in (|30l) is obtained since 1 — ^(V^m-A,/^) = 
1 — e~ Xi ^ m for all j < m. Let us denote the determinant in (l30l) as _D m . Then, by multiplying 
the first column in _D m with /i™"™ -1 and noting that /i™~ n ~ 1 <i-™~ n_1 ' > = 1 — (/i m /jUj) m_n _1 ~ 1, 
the first column of D m becomes all 1. Now, by eliminating the first m — 2 "l"s of the first 
column by subtracting all rows by the last row as in (|29l and (|30l) . we have ^J l ~ n ~ 1 D m = 
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fllLi (l ~~ e Xi ^ m ) D m _i. By continuing reducing the dimension, we get 



n+l 



Det(S 2 ) = Det [e"Vw] ^ JJ ^—i J] ^ 



i=l i=n+2 

n m 



•n n (i-e- A ^) 

i=l j=n+l 

from which we prove the lemma, by applying (|22|) . ■ 
With the two preceding lemmas, we have the following lemma that provides the asymptotical 

pdf of ck conditionned on (3 in the high SNR regime. 
Lemma 6: 

r SNR -£H/3) ; for ( a , 0) e ft„ifl, 
PH/3) = <^ (3D 
lSNR-°°, otherwise, 

where 

q q q m q 

j=l j=l j=l i<j J = 9+l i = l 

and 

Kcx\f3 -{ai< ... <a q , (3i< ... < P m , and $ < a h for i = 1, . . . , g} . (33) 
Proof: For n > m, applying the variable changes to (|19al ), we have 

m 

p{a\{3) = K Wi „(logSNR)' JJSNR- ( "- m+1) ^SNR- (m -"- 1)ft 

rn 

■ Y[ Yl (SNR~ a < - SNR^)(SNR- ft - SNR - ^) -1 

j=l i<j 

■ Det [exp (-SNR" (Q ^ ft) ) . 

The high SNR exponent of the quantity Det exp ^— SNR~ (Qy_,Si) J is calculated in Lemma HJ 
From (|22l) . we only need to consider a« > Vz, so that p(ct\{3) does not decay exponentially. 
Therefore, we have 

p(a|/3) = SNR-( E -i (n+1 - iK+E -i (i - n - 1)ft+ ^ m =i^< J (^-ft) + ), (34) 
if (a, 13) e TZ {1) and p(a\/3) = SNR~°° otherwise. 
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For n < m, with (|19b| ) and (1251) . we get 



p(a\P) = JJSNR- (m ~ n " 1)ft JJ SNFT 



■{m-i)j3i 



i=l i=n+l 

n m n 



]"[ QSNR " ,! 1 1 [ [ [JSNR i>,! <ir (35) 

j=l i<j j=n+l i=l 



Y[SNR-( n+1 - l)a > JJSNR ( 



JIMI 

i=l i=l 

for (a,/3) G 7£( 2 ) and p(a|/3) = SNR~°° otherwise. Combining the two cases, we prove the 
lemma. ■ 

When R — I, z'.e., [L\ — .. . — /i m = 1, the joint pdf of ex. is found in [11] as shown in the 
following lemma. 

Lemma 7: 

f SNR- E -i (m+n+1 ~ 2i)a \ for ex e TZ a , 
p{oc) = I (36) 
[SNR-°°, otherwise, 

with n a = {0 <ai<... < a,}. 

This lemma can be justified either by using (12TT) or by setting /3j = 0, Vi in (|32l) . 

Lemma 8 ( [23]): Let M be any m x n random matrix and T be any m x m non-singular 
matrix whose singular values satisfy a m i n (T) = cr max (T) = SNR°. Define q = min{m, n} and 
M = TM. Let o\(M) > ... > o-g(M) and oi(M) > . . . > a q (M) be the ordered singular 
values of M and M, Then, we have 

a t (M) = *i(M), Vi. 

Appendix II 
Proof of Theorem [1] 

The following lemma will be used repeatedly in the most of the proofs. 
Lemma 9: Let T k = [p k ,Pk-i], k = 1, . . . , N, be iV consecutively joint intervals with — 
— oo, po = no, and pfc's are defined as in (TT5T) . Then, we have 

d = l-i+ ^ l = ™ 1 ~ 1 , forzGJfc. (37) 
k 

Proof: Ci defined by © is the minimum of iV sequences corresponding to the iV values 
of k. It is enough to show that each of the iV sequences dominates in a consecutive manner. We 
omit the details here. ■ 
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A. Sketch of the Proof 

The proof will be by induction on N. From lemma|71 the theorem is trivial for N = 1. Suppose 
the theorem holds for some N and IT = Hi ■ ■ ■ Hn, we would like to show that it is also true for 
N + 1 and II' = H\ ■ • -Ifjv+i- For simplicity, the "primed" notations (e.g., a.', n', n', d , n' min , 
etc.) will be used for the respective parameters of II'. Note that II' (H') 1 ~ W no (n N+ i,HTV) for a 
given II, since II' = HH N+ i. According to lemma[2l the pdf of the eigenvalues A' of II' (II') is 
identical to that of W nmin (njv+i, diag(A)). Hence, the pdf of a.' can be obtained as the marginal 
pdf of (a', a) 

p(ct') = J p(ol', a)da 

JR n min 

= / p(a'\ct)p(a)da 

= [ SNFT^'^SNFT^da (38) 
Jn 

= SNR-z {a,) (39) 
where d38l) comes from lemma [6] and our assumption that © holds for N, with 

= {o <a[<... < a' <in , <ai<... < a„ min , and a, < a(, for i = 1, . . . , < in | (40) 
being the feasible region; the exponent E(cx') in (|39l) is defined by 

E(a') = min £(<*', a) (41) 
with a) = E(a'\a) + £(a). From Oil) and ©, 



a) = ^(riTV+i - i + l)a- + ^ I (j ; - 1 - tiat+i + c^a,- + ^ (a- - 

i=l j=l \ i<j 



+ E w + '>,)■ ) . (42) 



It remains to show E(a!) = E'(ct') = Qa- with 



c,- = 1 — z + mm 

fc=l....,JV+l 



i = l,...,n min (43) 



by solving the optimization problem (|4T[) . which is accomplished in the rest of the section. 
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1 2 )l mm 1 2 MuBn 1 2 / 

(a) Case 1 (b) Case 2 (c) Case 3 

Fig. 6. For each j, the black dots represent the a''s that are freed by a-j. Therefore, we can get the total number of freed 
a'i by counting the black dots in row i. More precisely, there are [g _1 (i)J — + 1 = |j? -1 (i)J — i black dots for 

i < g(n m in), and n min - + 1 = n min - i black dots for i > g(n min ). 



B. Solving the Optimization Problem 

1) Case 1 [n N+ i < n ]: In this case, we have n' min = n' = n N+ i. Minimization of E(a, a!) 
of (|42l) with respect to (w.r.t.) at can be decomposed into n min minimizations w.r.t. oti, . . . , a nmin 
successively, i.e., 



mm = mm • • • mm . 



We start with a.\. From (|33l) . the feasible region of oi\ is < ot\ < a[. Since the only ai-related 
term in (1421) is (ci — n N+ i)ai and c x — n N+ i > for n^+i < n , we have = 0. Now, suppose 
that the minimization w.r.t. a±, . . . , ct/_i is done and that we would like to minimize w.r.t. aj. 
For atj, j < n' min , we set the initial region as 

o <«;<■•■ < o^j < «j < 

in which we have 52i<j ( a i — = 0. The feasibility conditions in (|40l ) require that a 3 - must 
not go right across a'j. The only choice is therefore to go to the left. Each time ctj goes across 
an a' { from the right to the left, {a[ — dj) + increases by a' { — ctj, which increases the coefficient 
of a[ by 1 and decreases the coefficient of ay by 1. It can be shown that, to minimize the 
value of E(a,ct') w.r.t. ctj, otj is allowed to cross a[ only when the current coefficient of 
dj in (l42l) is positive!! So, ctj stops moving only in the following two cases : 1) it hits the 
left extreme, 0; and 2) its coefficient achieves when it is in the interval [a' k , a' k+1 ) for some 

7 When the coefficient of in J42t is positive, decreasing a t decreases E{ol,ol'). 
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k < j. Either case, a,, -related terms are gone and what remain are the a-'s "freed" by ctj from 
J2i<j ( a 'i ~ a j) + ■ Same reasoning applies to <x, for j > n' min , except that the initial region is set 
to < a[ < •• • < a', < ctj. 

min 

Therefore, the optimization problem can be solved by counting the total number of freed a'/s. 
As shown in Fig. |6(a)[ when j is small, the initial coefficient of <x, is large and thus <x, can 
free out c^_i, . . . ,a[. We have a* = 0, which corresponds to the first stopping condition. For 
large j, the initial coefficient of ctj is not large enough and only • • • , a ' g (j) 
corresponds to the second stopping condition. With the above reasoning, we can get g(j) 

'j ~ 1 - {j ~ 1 - n N+1 + Cj) + 1, for j < n' min , 

for j > n' min . 



aO-i is freed, which 



g(j) = 

From (EH) and ©, we get 



(44) 



n N+ i - Cj + 1, 



g(j) = n N+1 - min 

k=l,...,N 



and 



mm 

k=l,...,N 



k + l 



(45) 



(46) 



Now, E(ot') can be obtaine< 



from Fig. 6(a) 



"min 9(n min ) 

i=l i=l 

9(«min) 



E 



i=fl(n m in)+l 



(l-2z + njv+i+ [p _1 (i)J)a;+ ^ (1-21 + 71^+1 + 

i=l j= g (n min )+l 

9(n m in) 



E 



1 — i + min 

fc=2,...,iV+l 



E;=o g 

k 



El 1 

i=i \ 



2 + mm 

fc=l,...,7V+l 



a[ + (1 - 2z + riiv+i + n min ) a'i 

i=fl(n m in)+l 

(47) 

o£ (48) 

(49) 



In the above minimization procedure, we ignored the feasibility condition aj > otk, Vj > k. A more careful analysis can 
reveal that it is always satisfied with the described procedure. 
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where (1471) is from (1461) and the fact that h' Q = n N+1 , = r^_ l5 I — 1, . . . , N + 1; (1481) can be 
derived from lemma |9l since p[ = n^ + i + n — h\ = g(n min ) and therefore the term mm*; in 
(1481) is dominated by k > 2 for 2 < g(n min ) and by k = 1 for 2 > g(n mm ), corresponding to the 
two terms in (1471) . respectively. 

2) Case 2 [n N+ i e [n ,ni)/.- In this case, we have n ^ in = n min and h[ = n N+1 . From (l42l) . 

"min "min / \ 

S(a / , a) = ^(^+1 - i + l)a[ + I (j ; - 1 - n w+ i + c^a,- + ^ (a- - a^-) 4 " J . (50) 

i=l j=l \ i<j / 

Since j — 1 — ri7v +1 + Cj > 0, Vj < n' min , the minimization of i?(a', a) w.r.t. a is in exactly 
the same manner as in the previous case. Therefore, E{ot') can be obtained from Fig. |6(b)| with 
g(j) in the same form as (|45l) 



"mm 9(n m in) "min 

i=l i=l j= 9 (n min )+l 

= £'(«')■ (51) 

3) Case 3 [n^+i G [ni,oo)/: As in the last case, we have n' min = n min and the same 
E(a', a.) as defined in (l50l) . Without loss of generality, we assume that n N+ i E [n fe . ,n fc » +1 ) for 
some fc* e [1, A/] (we set Un+i — oo). Then, we have 

rlj = rz.;, for I — 1, . . . , k*, (52) 

and 

Vk* < p' k * < Pk*-i = Pk*-i < ■ ■ ■ < Pi = Pv (53) 

Unlike the previous case, j — 1 — n N+ i + Cj is not always positive. Let j be the smallest integer 
such that the coefficient j — 1 — n^+i + Cj of ay in (|5Q|) is zero. It is obvious that for j > j, 
a* = a'j. Hence, we have 

"min l~ l "min 

i=i «=i j=j 



where the second term is from Fig. |6(c)| Furthermore, we can show that j < p' k „, since p' k * — 
1 — n N+ i + c p ' kt = 0. Therfore, we get 

pi* — 1 ni.- 

_ * re mm 

S(a') = ^ (l - 2i + n N+1 + L<? _1 Wj ) «• + ^ (^+1 - 2 + l)aj + ^ CjoJ. (54) 

i=1 »=J i=p' fc * 
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Now, we would like to show that the coefficient of a • in (|54l) coincides with c^. First, for i < j — 1, 
i £ U • • • U Xjy and lemma [9] implies that 



1 — 2i + riN+i + o ^i) I = 1 — i+ min 

fc=2,...,iV+l 



1 — z + min 

fc=l,...,7V+l 



c'. 



Ek ~ i 



k 



l=O n l ~ % 



Then, for z > p' k ,, we have 



ie (T k . u ■ ■ ■ u T[) n (J fc * u ■ ■ • u 



Hence, 



= 1 — i + min 



1 — z + min 



k 

Ek 

k 



(55) 



where (1551 ) is from (|52l) and (|53l) . Finally, for z G [j,/4.)> let us rewrite i = p' k , 
i - 1 - Tijv+i + q = 0, Vz G \j,p' k *), we have 



Aj. Since 



Z)f=o ni-i- k*n N+1 
k* 



= 0, 

from which we have Aj G [0, k* — 1] and 

k* + 1 



+ 1 -z 



Eflo + n N+1 - + Aj 
k* + 1 

1 + - z. 



The proof is complete. 
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C. Proof of Theorem |6| 

To prove the first case, we use induction on N. Suppose that it is true for N, which means 
that the joint pdf of ct(JI g W g ) is the same as that of a^IHI 1 "). Furthermore, we know by lemma[8] 
that ct(IL g T NtN+l T NjN+l 1 lV g ) = a(U g W g ). Same steps as (f38T)(f39T) complete the proof. 

To prove the second statement, we perform a singular value decomposition on the matrices 
T i i+1 's and then apply the first statement. 



Appendix III 
Proof of Theorem [2] and Theorem [3] 



A. Proof of Theorem \2\ 



Let 



c 



(m) A_ 



1 — % + min 

k=l,...,m 



Efc ~ 
l=0 ni-i 



!)■■■) ^min ■ 



What we should prove is that 



,W _ „(*) 



c- ', for % = 1, . . . ,n min 



if and only if ([8]) is true. To this end, it is enough to show that 



C W = c (n i) for i = 1, . . . , n min 



(56) 



if and only if pn-i < N — 1, that is, (N — 1) (un + 1) > J^iLo 1 an d then apply the result 
successively to show the theorem. 

1) The Direct Part: The direct part is to show that, if pjv-i < iV — 1, then (l56l) is true. From 



lemma [9l we see that cf } = cf^, Vz> Pty-i. Hence, when Pn-i < h (EU) holds. Now, let 
us consider the case pn-i > 1- We would like to show that = c^* 1 ' for i e [l,pjv-i]- Let 
j = p/v-i - i G [0,pjv_i — 1]. Then, we rewrite the two quantities 



}2i=o n i - % 



N 



Z2l=0 n i- % 
N - 1 



n N + 



% + 



3_ 

N 



J 



N — I 



(57) 
(58) 



that are identical for Pn-i < N — 1, which proves that cf 1 ^ = c\ N 1 \ The proof for the direct 



part is complete. 
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2) Converse: If Pn-i > N — 1, then from (|57l) and (1581) . we have 7^ cf l ' at least for 
j = N — 1, that is, z = pat_i — (iV — 1). The proof is complete. 

B. Proof of Theorem \3\ 

The direct part of the theorem is trivial. To show the converse, let h = (n , ni, ■ ■ ■ , Un) and 
n = (h^h^, . . . ,h' N ,) be the two concerned minimal forms. In addition, we assume, without 
loss of generality, that 



ri\ — • ■ ■ — Ui x , . . . , Tii M _ 1+ i — ■ ■ ■ — n 



n' = • • • = n[, , . . . , n-/ j.-, = • • • = n'/ 



' 'M'-l +1 *Af' 



with jm < N and i' M , < N'. Now, let us define coi = Q — (1 — i) with q defined in (1371) . It can 
be shown that M intervals are non-trivial with \T ik \ 7^ 0, k = 1, . . . , M. The values of Cqj s are 
in the following form 

F*M I F^Af-ll |^1 1 

^ ^ 



-s . . ... ... 

' ) • • • ) W lM ) !>•••) !)•••) 1 • ' • 1 n iM- \ 1 • • • 1 n 2 !)•••) n l ~l" 1) n l ■ 



Same arguments also apply to n with M' and z', etc. It is then not difficult to see that to have 
exactly the same coi's (thus, same q's), we must have N = N' and 

hi = h[, Vi = 0, . . . , N, 

that is, the same minimal form. 

Appendix IV 
Proof of Theorem [5] 

A. Sketch of the Proof 

To prove the theorem, we will first show the following equivalence relations : 



(R[ N \k),Ri N \i,k)) 


(») 


(R[ N) 


Ri N \i,k) 


(fc) 


R?\ 


(R[ N \k),4 N \N-l)) 


(c) 




(R[ N) (k), Ri N) (i) with ordered n) 


(d) 


(R[ n) 



{R^>{k),R { 2 N) {i) with ordered n); 
(R^^k), R 2 N \n — 1) with ordered and minimal n) 
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1) Equivalences (a) and (b): The direct parts of (a), (b), and (d) are immediate since the 
RHS are particular cases of the left hand side (LHS). To show the reverse part of (a), we rewrite 

d(n ,...,n N )(k) = d( m -k,...,n N -k)(ty (59) 
= m ™d(n -k,..., ni -k)(j) + d(3,n*+i-*,....njv-fc)(°) ( 60 ) 

= mind (n0v .. ini) (j') + d {j , >n . +lr .. )nN) (k) (61) 

3 —k 

where R 1 is used twice in ([59]) and <[6D; R 2 is used in ([60]). As for (b), if Rf r> (N-l, k) holds, 
then 

d(n ,...,n N ) (k) = min d(n o ,...,r W _i)0') + <%™iv) ( 62 ) 
= , miri , d( n0) ..., njv _ 2 )0'') + do'W-oO') + d ij<nN )(k) (63) 

(fc) (64) 

which proves R^ N \n — 2,k). By continuing the process, we can show that R% N \i, k) is true 
for all i, provided R^ n \n — 1, k) holds. 

2) Equivalences (c) and (d): Through (a) and (b), one can verify that the LHS of (c) is 
equivalent to the RHS of (a) of which the RHS of (c) is a particular case. Hence, the direct 
part of (c) is shown. The reverse part of (c) can be proved by induction on N. For N = 2, 

can be shown explicitly using the direct characterization (|TT|) . Now, assuming that 
R { 2 N) (N-1) for non-ordered n, we would like to show that i?^ +1 (iV) holds. Let us write 

mmd {n „,...,„„) 0') + d (j>JV+l) (0) = mind ( ^ ^ )(j) + dy lfli )(0) (65) 

j>0 j>0 

= t TO n d (fio,-,fli-i.ft + d ik ,nN+i)U) + rf(?,ni)(0) (66) 

fc>.j>0 



rf(n ,... 1 n i _ 1 ,n i+ i,...,n JV )(^) + d{k,hi){j') + d(j>,n N+1 ){0) 



(67) 



mXQd(fio,„.,n w )0") + ^0',n iv+1 )(0) 



— ^(no,...,njv+i)(0) 

where the permutation invariance property is used in ((651) ; R$(N — 1, fc) is used in (l66l) since 
we assume that R^ n \N — 1) is trues; fij and rajv+i can be permuted according to R% 2 (1). Finally, 
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we should prove the reverse part of (d), i.e., 

d(n o ,...,n N )(0) = rnmd ( n ,...,n N _ 1 )(j) + jn N 

provided that R^\N — 1) holds for minimal n. 

If n is not minimal, then showing (c) is equivalent to showing 



(68) 



d(n ,...,n N *)(fy = minc^ 



i>o 



n ,...,h N *) 



U)+jn N 



(69) 



where N* is the order of n with fiN* + i < un- Therefore, we should show that the minimum is 
achieved with j = 0. According the direct characterization (fTT|) . this is true only when n N > c\. 
Let us rewrite c\ as 

N* 

N*n N * +1 +p N * 
N* 

Since p^* > N* is always true according to the reduction theorem, we have c\ < n N * + i < h N . 
The rest of this section is devoted to proving that (|68l) holds for minimal n. 



B. Minimal n 

Now, we restrict ourselves in the case of minimal and ordered n, i.e., we would like to prove 

d(n ,...,n N *){0) = mm d (fl0 ,...,n N » ^{j) + jn N . (70) 

Since 



"pn*-i = n N* + 1 -PN*-1 

< n N * + 1 — N* 

< n N *, 



the optimal j is in the interval T N * = \\,p N *_{\. Now, showing (1701) is equivalent to showing 



Pn*-i 



E 



i=l 



£z=o n * ~* 



N* 



Pn*-i 

mm > 1 

Piv*-i>j>0 



■^N*-l ~ 

Ez=o ni-t 
N* - 1 



which, after some simple manipulations, is reduced to 



Pm 

E 





i-1 


~ Pm + 


_M+1_ 



mm 

k 



i=l 



^2(i-PM + 



i - 1 
M 



(71) 
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where we set M = N* — 1 for simplicity of notation. Obviously, the minimum of the RHS of 
dTD) is achieved with such k* that 

k* - 1 



M 



k* 
M 



<0, 
> 0. 



k* - Pm + 

and (k* + 1) - p M + 
Let us decompose k* as k* = aM + b with b G [1, M\. Then, (1721 ) becomes 

aM + b -p M + a < 

which also implies that aN + 1— Pm + a < from which 

Pm ~ 1 
_M + 1 

The form of a suggests that pm can be decomposed as 

p M = a(M+l) + b. 



(72) 
(73) 

(74) 



(75) 



From (1741 ) and ( 1751) , we have b < b and thus b = min {M, 6}. With the form of optimal k and 
some basic manipulations, we have finally 

PM / 1 \ k* 



which ends the proof. 



i - 1 



M + l 



i=l 



i-1 



AI 







Appendix V 
Proof of Theorem [7J 

It can be proved by showing a stronger result : the asymptotical pdf of (^(IIppIIpF) in the high 
SNR regime is identical to that of a: (IP II). We show it by induction on N. For N = 1, since 
H^i = Hi, the result is direct. Suppose that the theorem holds for N. Let us show that it also 
holds for N + 1. Note that 

IIpp = H_ N+1 P N Up F = # Y Q ylll'l 

from which we have 

(n^ F ) f n^ F ~ W no (n N+u (D N Q' N U PF )\D N Q' N I[ PP )) 

- A(( ; D Ar g;np F ) t ( ; D^np F ))) 
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for a given II. Similarly, H n H' ~ Wn min ( n N+i> A(II t II)). In the high SNR regime, we can show 
that 

a ((D N Q^)XD N Q ] N n PP )) = a((Q;n PF ) t (g^n PF )) 

= a(np F n PF ) 

where the first equality comes from lemma [8] and the second one holds because 

(Q;np F ) t (g;n PF )=n PF n PF . 

Finally, since we suppose that the joint pdf of ai((II PF )n PF ) is the same as that of a^lTn), we 
can draw the same conclusion for a(((n pF ) t ) n pF ) and a((II / ) t II'). 
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Received E b /N (dB) 



Fig. 7. Horizontal reduction. 




Received E b /N (dB) 



Fig. 8. Vertical reduction. 
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Received E b /N (dB) 



Fig. 9. AF vs. PF. 




Fig. 10. Symbol error rate of coded performance. 
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Fig. 11. Transmission power gain of the AF multihop channel over the direct transmission. 
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